Cloning and identification of a two component signal transducing regulatory system from Bacteroides fragilis

ABSTRACT

This invention relates to a purified isolated DNA fragment of Bacteroides fragilis comprising a sequence for an operon containing two genes designated rprX and rprY. These genes encode two signal transducing regulatory proteins designated RprX and RprY. This invention further relates to the proteins RprX and RprY encoded by the operon. RprX and RprY affect the normal regulation of OmpF by OmpR and EnvZ.

This is a division of application Ser. No. 08/023,764, filed Feb. 26, 1993.

FIELD OF THE INVENTION

This invention relates to a purified isolated DNA fragment of Bacteroides fragilis comprising a sequence for an operon containing two genes designated rprX and rprY. These genes encode two signal transducing regulatory proteins designated RprX and RprY. This invention further relates to the proteins RprX and RprY encoded by the operon.

BACKGROUND OF THE INVENTION

All cells are able to monitor and make appropriate adaptive responses to changes in their environment. Such adaptive responses in bacteria are often regulated by a multi-component signal transducing system. When exposed to the appropriate stimuli, a sensory receptor, generally an inner membrane protein, is activated to autophosphorylate a histidine residue. This high energy phosphate is transferred from the histidine to an aspartate residue on the second protein in the regulatory cascade, the regulatory response protein. It is the phosphorylated regulatory response protein that then mediates the adaptive response either by interacting with DNA, altering transcription, or with a specific protein(s), altering its activity. In addition to regulation at the level of phosphorylation, the amount of phosphorylated regulatory response protein is further controlled by associated phosphatases.

Regulatory systems of this nature have been identified in over twelve diverse bacterial genera and have been found to regulate a variety of cellular processes including virulence and pathogenic determinants. (For a review see Bibliography entries 1,2). In Escherichia coli (E. coli), it is predicted that there may be 50 different pairs of signal transducing proteins. Some of these pairs include CheY, CheA and CheB, which are involved in the gliding versus tumbling response of bacteria in chemotaxis; PhoR and PhoB, which regulate phosphorus assimilation; and EnvZ and OmpR, which regulate outer membrane porin protein expression (2,3,4,5,6).

Bacteroides fragilis is the major anaerobic bacterium comprising the bowel flora of man. Bacteroides fragilis is also one of the major anaerobic bacteria isolated from anaerobic or mixed infections. As such, this bacterium produces various virulence factors which are involved in colonization and invasion of the organism. These factors include activities such as a nuraminidase.

This invention relates to the cloning and identification of a pair of signal transducing regulatory proteins cloned from Bacteroides fragilis, as well as to the phenotypes that expression of the regulatory proteins imparts upon E. coli.

SUMMARY OF THE INVENTION

A DNA fragment is cloned from Bacteroides fragilis that bestows low level tetracycline resistance to E. coli strains harboring the cloned fragment on a multi-copy plasmid. The tetracycline resistance determinant is localized to a 4.3 kilobase (kb) BglII-PstI subfragment of the original clone. DNA sequence analysis of this purified isolated fragment reveals that it contains an operon encoding two proteins, one of 519 amino acids, designated RprX, and a second of 233 or 236 amino acids, designated RprY. Protein sequence analysis reveals that the two proteins share sequence identity with a family of multi-component signal transducing regulatory proteins identified from many diverse bacterial genera.

RprX shares identity with the first component of the regulatory system, the histidine protein kinase receptor (for example: CpxA, PhoR, and CheA). RprY shares identity with the second member of the regulatory protein pair, the regulatory response protein, (for example: OmpR, PhoB, VirG, SpoOF and NtrC). Expression of these RprX and RprY proteins from a multi-copy plasmid vector in E. coil results in a decrease in the level of the outer membrane porin protein OmpF and an increase in the level of the outer membrane porin protein OmpC. The decrease in OmpF levels correlates with and may be the cause of the increase in tetracycline resistance.

Regulation of the levels of OmpF and OmpC is normally controlled by a multi-component signal transducing regulatory pair of proteins, EnvZ and OmpR. The effect RprX and RprY have on OmpF expression is shown to be mediated at the level of transcription. Thus, RprX and RprY are affecting the normal regulation of OmpF by OmpR and EnvZ.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A and 1B depict the restriction endonuclease map of the EcoRI fragments of the cloned plasmid pCLL2230 and localization of the tetracycline resistance locus. Panel A depicts the restriction endonuclease cleavage sites within the cloned DNA, which are shown above the line. The BglII restriction site shown below the line is part of the vector polylinker. The tetracycline resistance locus lies within the BglII-PstI fragment shown. Panel B depicts plasmids harboring subfragments of the original clone and indicates their ability to confer tetracycline resistance. Restriction enzyme abbreviations: B, BglII; E, EcoRI; N, NdeI; P, PstI; and S, StuI.

FIG. 2 depicts the open reading frames and restriction endonuclease sites identified within the BglII-PstI fragment. Shown below the depicted BglII-PstI fragment are the location and direction of transcription of the four open reading frames (ORFs) discussed in the specification. The arrows shown above the line represent other potential ORFs and their direction of transcription. The lollypop depicts the location of the 24 base pair inverted repeat. The location of the IS1 insertion within rprX is indicated. The DNA deleted in the BglII-HindIII deletion (ΔHindIII) is also indicated.

FIGS. 3A-F depict the DNA sequence of the BglII-PstI fragment from nucleotide 356 to 3006. This corresponds to the nucleotides numbered 1 to 2651 in SEQ ID NO. 1. Below the sequence is the translated amino acid sequence of the two large open reading frames encoding RprX and RprY. The first ATG codon of each open reading frame is underlined. The * indicates the stop codon terminating each polypeptide. Potential -10, -35, and Shine-Dalgarno (S-D) sequences are indicated above their respective sequences. The two copies of a 12 base pair repeated sequence upstream of the RprX coding sequence are underlined. The 24 base pair inverted repeat at the end of the coding sequence for RprY is underlined. The location of the ISl insertion within RprX is indicated with an arrowhead between nucleotides 2043 and 2044 (numbered 1688 and 1689 in SEQ ID NO. 1). The HindIII site (nucleotides 1037-1042 (SEQ ID NO.1 682-687)) and two SspI sites (nucleotides 2246-2251 (SEQ ID NO. 1 1891-1896) and nucleotides 2955-2960 (SEQ ID NO. 1 2600-2605)) are indicated above the respective cleavage recognition sequences.

FIG. 4 depicts a comparison of the conserved amino acid sequence within the homologous domains of several histidine kinase protein kinase proteins to RprX (SEQ ID NOS. 30-33, from relevant portions of SEQ ID NO. 2). Totally conserved amino acids are indicated with an * in the top line. Among these conserved residues is the proposed phosphorylated histidine (1,7,8). The numbers in brackets indicate the number of residues separating the conserved regions. Protein sequences are from the following sources: CpxA from E. coli (SEQ ID NOS. 4-7)(9), PhoR from E. coli (SEQ ID NOS. 8-11)(4), CheA from E. coli (SEQ ID NOS. 12-15)(6), VirA from Agrobacterium tumefaciens (SEQ ID NOS. 16-19)(10), and NtrB from Bradyrhizobium parasponia (SEQ ID NOS. 20-23)(11).

FIG. 5 depicts a comparison of the conserved amino acid sequence within the homologous N-terminal domains of several regulatory response proteins to RprY (SEQ ID NOS. 34-38, from relevant portions of SEQ ID NO. 3). The three highly conserved residues are depicted with an * in the top line. The second conserved aspartate has been demonstrated to be the phosphorylated residue for several of the regulatory response proteins (7,8). Residues corresponding to those which comprise the hydrophobic core of CheY are underlined (1,2). Protein sequences are from the following sources: OmpR from E. coli (SEQ ID NO. 24)(3), VirG from Agrobacterium tumefaciens (SEQ ID NO. 25)(12), NtrC from Bradyrhizobium parasponia (SEQ ID NO. 26)(11), PhoB from E. coli (SEQ ID NO. 27)(4), SpoOF from Bacillus subtilis (SEQ ID NO. 28)(13), and CheY from E. coli (SEQ ID NO. 29)(1,14).

FIG. 6 depicts a hydropathy plot of RprX.

FIG. 7 depicts an ethidium bromide stained agarose gel of BglII-SspI restricted plasmid DNA isolated from small and large colony forming pCLL2233 containing cells. Lanes A-G, plasmid isolated from 7 independent large colony formers (see text); lane H, plasmid isolated from a small colony former; lane I, pCLL2300 vector plasmid. Denoted to the right are the band position and size of the vector and insert fragments. The bands representing fragments of the insert DNA are indicated with an *. The arrowheads indicate the position of new bands identified among lanes A-G.

FIG. 8 depicts a Coomassie blue stained polyacrylamide gel of cellular fractions of DH5α harboring either pCLL2300 (vector); pCLL2233 two independent transformed isolates, isolate 1 and isolate 2; pCLL2233 containing the IS1 insertion known to reside within the coding sequence for RprX on the 2.2 kb BglII-SspI fragment, IS 2.2; pCLL2233 with a DNA insertion within the 0.7 kb SspI fragment, IS 0.7; or pCLL2233 containing the BglII-HindIII deletion, ΔHindIII. WC, whole cell lysate; S, soluble cell fraction; and M, outer membrane fraction. The positions of OmpF, OmpC, and OmpA are indicated.

DETAILED DESCRIPTION OF THE INVENTION

The purified isolated DNA fragment of B. fragilis of this invention, which comprises a sequence for an operon containing two genes designated rprX and rprY, is obtained from an existing clone bank of B. fragilis DNA. A total chromosomal clone bank of B. fragilis DNA had been created previously (15,16) by digesting B. fragilis chromosomal DNA with EcoRI, cloning into pCLL2300 (a kanamycin resistance conferring vector), and transforming in E. coli selecting for kanamycin resistance. The original clone bank is constructed in E. coli DH5α (F-, endA1, hsdR17(r_(k) -,m_(k) +), supE44, thi-1, recA1, gyrA96, relA1 Δ (argF-lacZYA)U169, Φ80dlacZΔM15) (Bethesda Research Laboratories, Inc., Gaithersburg, Md.). Unless otherwise indicated, all isolates studied are derivatives of E. coli DH5α.

Tetracycline resistance conferring plasmids are identified by plating approximately 10⁵ -10⁶ cells from the pool of 20,000 transformed colonies onto kanamycin and tetracycline (3 μg/ml) containing plates and incubating under anaerobic conditions at 37° C. Tetracycline resistance colonies appear as small colonies after two to three days of incubation.

All experiments are performed using LB medium (17) supplemented, when required, with kanamycin (25 μg/ml) and tetracycline. All cultures are incubated at 37° C. Anaerobic growth conditions are achieved with the use of a BBL GasPak jar (Becton Dickinson and Co., Cockeysville, Md.). Several hundred small colonies are visible. Plasmid DNA is isolated from several of these tetracycline resistant colonies and the DNA insert is characterized.

Endonuclease restriction analysis is then performed. Restriction enzymes, calf-intestinal phosphatase and T4 DNA ligase are obtained from Boehringer Mannheim Biochemicals (Indianapolis, Ind.) and used according to the manufacturer's instructions (18). Standard recombinant DNA techniques are performed according to the protocols in Sambrook et al. (18).

This analysis reveals that all of the plasmids examined harbor two large EcoRI fragments, each approximately 5 kb in size. DNA sequence analysis subsequently reveals a third fragment of 75 base pairs (the EcoRI-EcoRI fragment shown at the right side of pCLL2230 in FIG. 1A). The identification of the same size EcoRI fragments in all the plasmids studied suggests that the tetracycline resistant colonies are siblings, originating from one initial transformed cell. The plasmids are transformed into a fresh background and found to be capable of conferring low level tetracycline resistance, suggesting that a tetracycline resistance determinant is cloned on the plasmid.

One plasmid, designated pCLL2230, is selected for further studies. This plasmid contains two EcoRI fragments, each approximately 5 kb in length (FIG. 1A). Southern hybridization analysis using sequences from the two EcoRI cloned fragments to probe B. fragilis DNA indicates that both EcoRI fragments are of B. fragilis origin and are not contiguous on the chromosome.

To facilitate localization of the putative tetracycline resistance determinant, a restriction map is generated (FIG. 1). Utilizing this map, various regions of the cloned DNA fragments are selected for subcloning and determination of their ability to confer tetracycline resistance (FIG. 1). From this analysis, the low level tetracycline resistance locus is localized to a 4.2 kb BglII-PstI fragment, which is a subfragment of the slightly smaller of the two approximately 5 kb EcoRI fragments.

E. coli transformed with a multi-copy plasmid harboring the tetracycline resistance locus on either the BglII fragment (pCLL2233) or the slightly larger PstI fragment (pCLL2234) (FIG. 1B) grows slowly. When grown on LB medium, isolates harboring pCLL2233 have a doubling time of 105 minutes versus 45 minutes for an isogenic strain harboring the vector with no insert. Cells harboring pCLL2233 or pCLL2234 also form small colonies, requiring two days at 37° C. to form normal sized colonies on rich medium. The tetracycline resistance, although detectable, is difficult to score when cells harbor these subfragments of the original clone. The resistance is also unstable and easily lost, in the absence of selection, without concomitant loss of the plasmid.

The entire BglII fragment (pCLL2233) is sequenced to aid in the identification of the mechanism of the tetracycline resistance conferred by the fragment. To accomplish this, the BglII fragment is subcloned and both DNA strands are sequenced. DNA sequence analysis is performed using the Sanger dideoxy method (19) with the Sequenase™ kit (United States Biochemical, Cleveland, Ohio) used according to the manufacturer's directions.

To facilitate sequencing, the BglII fragment is subcloned. A BglII-HindIII, a HtndIII-HindIII and a BglII-NsiI fragment are cloned into pUC118 and pUC119 (20) (FIG. 2). One of the HindIII restriction sites is part of the polylinker into which the BglII fragment is cloned. Sequencing is performed on either single stranded DNA, prepared using the helper phage M13K07 (20), or double stranded plasmid DNA, prepared by the boiling mini prep method (18).

Both strands of the BglII-HindIII and HindIII-HindIII fragments are sequenced in their entirety. The BglII-NsiI fragment is sequenced in the region corresponding to the HindIII junction of the other two fragments. The primers utilized are the M13-40 universal primer (New England Biolabs, Beverly, Mass.) and additional synthetic obligonucleotides spaced 150-250 bases apart and complementary to determined DNA sequence. DNA sequence analysis, translation, protein data base searches, and protein sequence comparisons are performed using the DNA Star™ (DNA Star Inc., London, England) computer program.

The DNA sequence shows several interesting features, as depicted in FIGS. 2 and 3. There are two open reading frames (ORFs): The first ORF encodes a protein of 519 amino acids. The second ORF encodes a smaller protein of 233 or 236 amino acids (there are two potential ATG initiation codons for this protein, resulting in a protein of 233 or 236 amino acids depending on which ATG is utilized to initiate translation). From the DNA sequence, the two genes appear to be coordinately expressed from the same promoter and, therefore, compose an operon.

Due to their homology with known regulatory proteins (see below), the 519 and 233 or 236 amino acid proteins are designated RprX and RprY (Regulatory protein X and Y), respectively. Their respective genes are designated rPrX and rprY. The coding sequences of rprX and rprY overlap by five base pairs (nucleotides 2173-2177 in FIG. 3; numbered 1818-1822 in SEQ ID NO. 1) if the first ATG in rprY is utilized as the initiation codon. Alternatively, a second ATG codon, located five base pairs downstream from the stop codon for rprX, could also function as the translation initiation codon for rprY.

The locations of the two ORFs within the BglII-PstI fragment are as follows: FIG. 3 depicts the operon and flanking regions as nucleotides numbered 356-3006 (numbered 1-2651 in SEQ ID NO. 1). The rprX gene is nucleotides 621-2177 in FIG. 3 (numbered 266-1822 within SEQ ID NO. 1). If the first possible ATG in rprY is the initiation codon, then the rprY gene is nucleotides 2173-2880 in FIG. 3 (numbered 1818-2525 within SEQ ID NO. 1). If the second possible ATG in rprY is the initiation codon, then the rprY gene is nucleotides 2182-2880 in FIG. 3 (numbered 1827-2525 within SEQ ID NO. 1).

Preceding the rprX coding sequence are several potential -10 and -35 sequences, based upon E. coli consensus sequences (21), and a potential Shine-Dalgarno sequence, based upon the B. fragilis 16S rRNA sequence (22). No potential Shine-Dalgarno sequence is identified upstream of either ATG codon of the rprY coding sequence.

Within the sequences directly upstream of the rprX coding sequence is a tandem repeat of 12 base pairs. Immediately following the rprY coding sequence is a 24 base pair perfect inverted repeat separated by four base pairs that could function in transcription termination (23). Within the 1.5 kb downstream of the operon, there are several small ORFs encoding proteins of 81 to 169 amino acids. There are also ORFs extending beyond both ends of the DNA fragment. It is to be noted that within each of the rprX and rprY coding sequences is a small ORF, designated ORF3, encoding a protein of 170 amino acids, and ORF4, encoding a protein of 149 amino acids, respectively. These small ORFs are each in the same frame as the proteins encoded by rprX and rprY, but are transcribed from the opposite strand. There is no indication that ORF 3 and ORF 4 are functional.

The predicted protein sequences of both RprX and RprY are compared with the GenBank protein data base. RprX shares sequence identity with CpxA (26.9%), PhoR (20.4%), and CheA (18.6%), and weaker identity with VirA and NtrB. All of these proteins are histidine protein kinases and represent one component of a multi-component signal transducing regulatory system. The regions of greatest identity between proteins of this family are the histidine kinase domain, generally located near the carboxy terminal end of the protein, and the region surrounding the phosphorylated histidine residue. The amino acid sequence N-(then an intervening 15 to 45 residues) DXGXG-(SEQ ID NOS. 39) (then an intervening 20 to 50 residues)-GXG is conserved among all the histidine protein kinases (1,2,8). With the exception of CheA and FrzE (another histidine protein kinase protein), the proposed autophosphorylated histidine residue lies approximately 100 amino acids upstream of the conserved asparagine (1). RprX contains all of these features (FIG. 4).

RprY shares sequence identity with OmpR (32.6%), PhoB (28.7%), VirG (28.7%), SpoOF (27.7%), and NtrC (21.0%) and weaker identity with CheY, SpoOA, and several other regulatory response proteins. These proteins comprise the second protein of the multi-component signal transducing regulatory systems. The average homology among this class of proteins is 20-30%, with the strongest homology located in the amino terminal half of the protein (1,2). Within this region, there are three absolutely conserved amino acids: two aspartates, one being the phosphorylated aspartate, and a lysine (1,2,7,8). There are also several conserved stretches of hydrophobic amino acids (1,2). Both aspartates, the lysine and the hydrophobic regions are present within RprY (FIG. 5).

Thus, these amino acid sequence comparisons of RprX and RprY to other known proteins clearly reveal that the two proteins share strong sequence identity with a family of multi-component signal transducing regulatory proteins. Detailed analysis of the conserved amino acids and overall protein structure of the two proteins with other known regulatory proteins indicate that RprX and RprY contain all the features characteristic of known histidine protein kinases and regulatory response proteins, respectively. From these findings it is hypothesized that RprX and RprY represent two proteins that together form a pair of regulatory proteins. This is then tested by a series of characterization studies discussed below.

The signal transducing regulatory proteins RprX and RprY of this invention are expressed using established recombinant DNA methods. Suitable host organisms include bacteria, viruses, yeast, insect or mammalian cell lines, as well as other conventional organisms. For example, the E. coli strain DH5α is transfected with either of the multi-copy plasmids designated pCLL2230 or pCLL2233, which include the rDrX and rprY genes. The host cell is then cultured under conditions which permit the expression of the RprX and RprY proteins.

Samples of an E. coli strain DH5α containing the multi-copy plasmid designated pCLL2230 (which includes the rprX and rprY genes of this invention) were deposited by Applicant on Feb. 22, 1993, with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Md. 20852, U.S.A., and have been assigned ATCC accession number 69246.

The present invention further comprises DNA sequences which, by virtue of the redundancy of the genetic code, are biologically equivalent to the DNA sequences described specifically herein which encode for the regulatory proteins, that is, these other DNA sequences are characterized by nucleotide sequences which differ from those set forth herein, but which encode regulatory proteins having the same amino acid sequences as those encoded by the DNA sequences set forth herein.

In particular, the invention contemplates those purified isolated DNA fragments containing sequences encoding the regulatory proteins which are sufficiently duplicative of the sequences of the operon contained within SEQ ID NO. 1, so as to permit hybridization therewith under standard high stringency Southern hybridization conditions, such as those described in Sambrook et al. (18), as well as the biologically active regulatory proteins produced thereby.

This invention also comprises DNA sequences which encode amino acid sequences which differ from those of the described regulatory proteins as set forth in SEQ ID NOS. 2 and 3, but which are the biological equivalent to those described for the regulatory proteins. Such amino acid sequences may be said to be biologically equivalent to those of the regulatory proteins if their sequences differ only by minor deletions from or conservative substitutions to the regulatory proteins, such that the tertiary configurations of the sequences are essentially unchanged from those of the regulatory proteins.

For example, a codon for the amino acid alanine, a hydrophobic amino acid, may be substituted by a codon encoding another less hydrophobic residue, such as valine, leucine or isoleucine. Similarly, changes which result in substitution of one negatively charged residue for another, such as aspartic acid for glutamic acid, or one positively charged residue for another, such as lysine for arginine, as well as changes based on similarities of residues in their hydropathic index, can also be expected to produce a biologically equivalent product. Nucleotide changes which result in alteration of the N-terminal or C-terminal portions of the molecule would also not be expected to alter the activity of the regulatory proteins. It may also be desirable to eliminiate one or more of the cysteines present in the sequence, as the presence of cystsines may result in the undesirable formation of multimers when the protein is produced recombinantly, thereby complicating the purification and crystallization processes. Each of the proposed modifications is well within the routine skill in the art, as is determination of retention of activity of the encoded regulatory proteins. Therefore, where the terms "gene", "DNA encoding sequence" and "sequence" are used in either the specification or the claims, each will be understood to encompass all such modifications and variations which result in the production of a biologically equivalent regulatory protein.

A series of characterization studies is carried out regarding the structure and function of RprX and RprY. First, the hydrophobicity of RprX is analyzed. Many of the signal transducing proteins similar to RprX are inner membrane proteins. They contain two membrane spanning regions, a periplasmic domain containing the sensory receptor and a carboxy terminal cytoplasmic domain harboring the histidine protein kinase activity, as well as the autophosphorylated histidine residue. A hydropathy plot is performed on RprX to predict potential membrane spanning and soluble domains utilizing the algorithms of Kyte and Doolittle (24) (FIG. 6). Interpretation of the plot leads to the prediction that there are two potential membrane spanning regions: one between the amino acids 4 and 26, and a second between amino acids 252 and 281. The intervening regions between these segments are hydrophilic and predicted to be soluble domains, with the more amino terminal domain residing in the periplasm and the carboxy terminal domain residing in the cytoplasm. This is in agreement with the structure of other known histidine protein kinase receptors and the protein sequence alignment data.

Next, disruption of the rprX and rprY coding sequences is studied. E. coli cells harboring the BglII DNA fragment encoding RprX and RprY display low level tetracycline resistance, grow slowly and form small colonies on rich media. Both the slow growth/small colony phenotype and the ability to confer tetracycline resistance are dependent upon expression of RprY and perhaps RprX.

Analysis of the initial DNA sequence of the BglII fragment indicates that sometime during subcloning and propagation of the plasmid for sequencing, an insertion sequence element designated IS1 had been inserted within the coding sequence of rprX. An insertion element is a small transposable element capable of inserting, more or less randomly, into DNA.

The location of the IS1 insertion is depicted in FIGS. 2 and 3. Cells transformed with pCLL2233 containing the IS1 insertion no longer exhibit the slow growth/small colony phenotype or low level tetracycline resistance of cells harboring pCLL2233, carrying a wild-type BglII fragment.

Thus, it appears that expression of RprX and/or RprY is responsible for the phenotypes associated with harboring the BglII DNA fragment. It is observed that, when pCLL2233 plasmid DNA is used to transform E. coli cells, between 1% and up to 50% of the transformed colonies are of normal size while the remainder are small or pin-point sized colonies. If expression of RprX and/or RprY is responsible for the small colony phenotype, these larger sized colony transformants might contain pCLL2233 plasmids in which the rprXY operon is disrupted. This could be easily accomplished by the insertion of a DNA sequence (i.e. an insertion sequence (IS) element) within the rprXY operon.

Seven independent large colony forming pCLL2233 transformed isolates are examined. Large colony variants are obtained by growing pCLL2233 harboring cells overnight, isolating plasmid DNA from them, transforming the DNA into a fresh background, and identifying large colony forming transformants. Plasmid DNA is obtained from these large colony forming isolates and subjected to restriction analysis. Digestion of pCLL2233 with BglII and SspI results in the formation of three subfragments comprising the cloned BglII fragment, a 2.2 kb fragment containing rprX and less than 80 base pairs of rprY coding sequence, a 0.7 kb fragment containing the remainder of the rprY coding sequence plus 75 bp downstream of the rprY coding sequence including the 24 bp inverted repeat, and a 1.5 kb fragment comprising the remainder of the BglII fragment (FIG. 2). The subfragments are visualized by an ethidium bromide stained agarose gel.

When plasmid DNA from the seven large colony forming isolates is examined, either the 2.2 or 0.7 kb band is lost and a new, slower migrating band appears (FIG. 7). Four of the plasmids examined show a loss of the 2.2 kb fragment (lanes A, E, F, and G, FIG. 7). The 0.7 kb fragment is missing from the other three plasmids (lanes B, C, and D, FIG. 7). E. coli transformed with any of these seven plasmids no longer exhibits the small colony phenotype or increased tetracycline resistance.

These results indicate that DNA insertions within the 2.2 kb BglII-SspI fragment, one of which is known to lie within the coding sequence for RprX, or the 0.7 kb SspI-SspI fragment, 90% of which is RprY coding sequence, eliminate all the observed phenotypes. This strongly suggests that RprY and perhaps RprX expression is responsible for the observed phenotypes.

Studies utilizing a DNA insertion within the 2.2 kb fragment use the original IS1 insertion within the rprX coding sequence. The exact location of the DNA insertions within the 0.7 kb fragment is not determined.

Because the only ORFs within the 0.7 kb fragment are rprY and ORF4, and their coding sequence comprises up to 90% of the 0.7 kb DNA sequence, any insertion within this fragment is likely to disrupt one or both of these two ORFs. However, it cannot be ruled out from these DNA insertions alone that one or both of the two small ORFs, ORF3 or ORF4, whose coding sequences overlap those of RprX and RprY, respectively, are responsible for the observed phenotypes.

Therefore, in addition to the DNA insertions, a construct deleting the upstream and amino terminal coding sequence of rprX is created. DNA sequences between the BglII and the HindIII restriction sites (ΔHindIII) (FIG. 2) are removed. This deletion does not disrupt the upstream or coding sequences of ORF3 or ORF4, but does delete the upstream DNA and 5' coding region of rprX. E. coli transformed with pCLL2233 ΔHindIII displays normal growth characteristics and exhibits no increased tetracycline resistance. Therefore, the deletion results in the elimination of expression of RprX and RprY and, consequently, the elimination of all phenotypes associated with harboring the BglII fragment.

This indicates that expression of RprY and perhaps RprX, and not the polypeptides encoded by ORF3 and ORF4, is responsible for the observed phenotypes. Since rprX and rprY form an operon, disruption of rprX would have a polar effect on the expression of rprY. Thus, it cannot be determined if RprX contributes to the observed phenotypes. Expression of RprY alone may be sufficient to elicit the observed phenotypes.

Next, the effect of the cloned BglII fragment on the level of OmpF and OmpC in E. coli is studied. The observation that the slow growth/small colony phenotype could be eliminated by DNA insertions within rprX and rprY indicates that expression of one or both of the Rpr proteins has a negative effect on cell growth. Among the different types of proteins whose expression is regulated by a multi-component signal transducing regulatory system are outer membrane proteins, for example, the porin proteins OmpF and OmpC by the EnvZ/OmpR regulatory system, and PhoE by the PhoB/PhoR regulatory system (1). Therefore, the possibility that the B. fragilis regulatory proteins may be affecting the expression of E. coli outer membrane proteins is investigated.

A series of E. coli DH5α cells is transformed with, respectively, pCLL2300 (labelled vector in FIG. 8), pCLL2233 two independent transformed isolates (labelled isolate 1 and isolate 2), pCLL2233 with the IS1 element within the 2.2 kb BglII-SspI fragment (labelled IS 2.2), pCLL2233 with a DNA insertion within the 0.7 kb SspI fragment (labelled IS 0.7), or pCLL2233 with the ΔHindIII deletion (labelled ΔHindIII).

These cells are fractionated and SDS polyacrylamide gel electrophoresis is performed as follows: Overnight cultures of the appropriate isolates are diluted 1:50 into 200 mls of fresh LB medium. Cultures are grown with aeration at 37° C. to a final OD₅₅₀ of 0.6-0.7. Cells are harvested, washed once with 10 mM HEPES, pH 7.4, and resuspended in 2.5 mls of 10 mM HEPES, pH 7.4. The cells are broken by two passes through a French press at 15,000 psi. After breaking, MgCl₂ is added to 1 mM. One tenth ml of lysed cells is removed, representing the whole cell fraction. The remainder is centrifuged for 30 minutes at 20,000×g at 4° C.

The supernatant, soluble fraction is decanted and saved. The pellet is resuspended in 5 mls of water and centrifuged again for 30 minutes at 20,000×g at 4° C. The final pellet, sacculas or outer membrane prep, is resuspended in 2.4 mls 10 mM HEPES, pH 7.4. Each fraction is mixed with an equal volume of 2× sample buffer boiled for five minutes, loaded on a 10% polyacrylamide gel and electrophoresed as previously described (25). Utilizing these gel conditions (no urea), the OmpF and OmpC protein bands separate. The OmpC protein migrates faster and, therefore, appears below the OmpF band (see FIG. 8).

It is observed that cells transformed with pCLL2300 (vector) contain nearly equal amounts of OmpF and OmpC in their outer membrane when grown on LB medium broth. In contrast, cells transformed with pCLL2233 contain little or no detectable OmpF in their outer membrane and show increased levels of OmpC under the same growth conditions. Disruption of rprX and/or rprY by DNA insertion, DNA insertions within the 2.2 kb or 0.7 kb fragments respectively, restores the levels of OmpF and OmpC to those of cells harboring vector alone. Deletion of the upstream and amino terminal coding sequences of rprX (ΔHindIII deletion) also restores the levels of OmpF and OmpC to those of cells harboring vector alone (FIG. 8). Immune precipitation of OmpF from cellular extracts of isolates harboring pCLL2300 (vector), pCLL2233 with a DNA insertion within the 2.2 kb or 0.7 kb fragments, or pCLL2233 with the ΔHindIII deletion reveals that all four isolates synthesize OmpF. In contrast, no OmpF can be detected by immune precipitation from cellular extracts of cells which harbor a known wild-type pCLL2233 plasmid.

Regulation of the outer membrane porin proteins OmpF and OmpC normally occurs at the level of transcription. Transcriptional regulation is coordinated through EnvZ and OmpR and correlates with the levels of OmpC and OmpF in the outer membrane. Since RprX and RprY appear to represent regulatory proteins of the same family as OmpR and EnvZ, it is of interest to determine if their effect on the amount of OmpF in the outer membrane is also mediated at the level of transcription. This is assessed by assaying the beta-galactosidase activity of an ompF-lacZ operon fusion harboring strain, MH513 (26), containing various RprX and RprY encoding plasmids.

In this MH513 strain, transcription of lacZ is under control of the ompF promoter, but translation is under control of the lacZ promoter. Therefore, translation of lacZ is independent of ompF regulation. In this situation, beta-galactosidase levels reflect the relative abundance of the transcript, and thus the level of transcription.

In this beta-galactosidase assay, MH513 (26), ompF-lacZ operon fusion, and MH225 (26), ompC-lacZ operon fusion, C600 (27, New England Biolabs) cells are transformed with pCLL2300 and derivatives of pCLL2233. Two ml cultures of LB medium broth are inoculated with a transformed colony and incubated at 37° C. overnight. The overnight cultures are diluted 1:4 to 1:50 into 2 mls of LB medium and incubated for several hours at 37° C. When the cultures reach mid log phase, the cells are harvested and resuspended in two mls of Z buffer (17). One ml is removed for determination of the OD₆₀₀. The remaining cells are mixed with one drop of 1% SDS and two drops of CHCl₃. A 12-100 μl aliquot of the cell suspension is added to microtiter plate wells and the final volume adjusted to 200 μl with Z buffer. The reaction is initiated by the addition of 50 μl of 10 mg/ml ONPG (ortho-nitrophenyl beta-D-galactopyranoside) in Z buffer without added beta-mercaptoethanol. The absorbance is monitored at 405 nm using a Molecular Devices (Menlo Park, Calif.) microplate reader. The formula used to calculate the relative level of enzymatic activity is: ##EQU1##

The results of the beta-galactosidase assay are that, when MH513 harbors either the vector plasmid, pCLL2300, or any of the three plasmids on which rprY and/or rprX are inactivated, pCLL2233 IS 0.7, pCLL2233 IS 2.2, or pCLL2233 ΔHindIII, equivalent levels of beta-galactosidase enzymatic activity are detected. However, isolates harboring pCLL2233 containing wild-type rprX and rprY genes show an 8-10 fold or greater decrease in the level of beta-galactosidase enzymatic activity. One culture has no detectable beta-galactosidase activity.

Similar studies with a strain harboring an ompC-lacZ operon fusion cannot be interpreted due to the high instability of these cultures, perhaps a result of the stress resulting from their inability to compensate for the decrease in OmpF levels by synthesizing additional OmpC. As a control, studies of the effect of RprX and RprY expression on LacZ activity expressed from wild-type lacZ are investigated. The results show that there is only a less than 25% decrease in the level of enzymatic activity between extracts from cells harboring either the vector, pCLL2300, or any of the three plasmids containing an inactivated rprY or rprX gene and cells harboring plasmid pCLL2233 containing wild-type rprX and rprY genes.

The tetracycline resistance imparted upon E. coli harboring the cloned BglII fragment is not the result of any known mechanism of specific tetracycline resistance such as an efflux pump, ribosomal protection, or tetracycline inactivation (28). No DNA sequence homology between the tetracycline resistance imparting fragment and genes representing any of these mechanisms of resistance is identified. Instead, the resistance may be a secondary effect of the reduction in OmpF levels. Tetracycline resistance in gram negative bacteria can be greatly affected by changes in the outer membrane that alters its permeability. Decreases in the level of OmpF have been correlated with increases in the resistance to tetracyclines and other antibiotics (29,30). The tetracycline resistance observed here correlates with the reduction in the level of OmpF.

The effect of rprX and rprY expression on growth rates and colony size are not as easily explained. Elimination of OmpF alone has not been observed to have a dramatic effect on the growth rate when E. coli are grown in LB medium. The small colony phenotype may be the result of a more pleotrophic effect of rprY and/or rprX expression on cellular regulation. There are at least 20 known multi-component regulatory protein pairs in E. coli and there are predicted to be as many as 50 total pairs (1,2), regulating a plethora of cellular functions. Several studies have shown that these regulatory proteins can "crosstalk" (31,32). That is, the histidine protein kinase component of one regulatory system can phosphorylate the regulatory response protein of a second regulatory system. This suggests that the regulatory proteins might form a network of regulation.

The introduction of an additional set of regulatory proteins, not normally present within the cell, may interfere with the normal regulation of one or more cellular processes. The RprX and RprY proteins of this invention seem to interfere with the ability of the normal regulatory proteins to appropriately regulate their target by either altering the levels of phosphorylated versus non-phosphorylated regulatory proteins or by supplanting regulation by the normal regulating proteins and directly controlling expression of the target. The overall effect is manifested as a slow growth/small colony phenotype.

Because expression of RprX and/or RprY has a profound effect on the level of OmpF, and these two proteins are of the same regulatory protein family as EnvZ and OmpR, the normal regulators of OmpF expression, it is of interest to determine if RprX and/or RprY are "regulating" OmpF expression at the same level as EnvZ and OmpR. Normally, OmpF levels are controlled by regulating transcription of ompF.

Using an ompF-lacZ operon fusion, it is determined that expression of wild-type RprX and RprY results in an 8-10 fold or greater decrease in the level of beta-galactosidase activity, while only a slight decrease in beta-galactosidase activity is observed when LacZ transcription and translation is directed by the lacZ promoter. This strongly indicates that RprX and RprY are affecting the level of transcription initiating from the ompF promoter and that the decrease in beta-galactosidase activity is not the result of decreased translation or stability of LacZ in the rprX and rprY expressing cells.

The ability of RprX and RprY to influence OmpF and OmpC protein levels at the level of transcription suggests that they elicit regulation in E. coli. This, in conjunction with the protein sequence homology data, suggests that the two proteins are regulatory proteins and would perform a regulatory function in B. fragilis, such as regulation of expression of B. fragilis outer membrane proteins or another cellular system.

Bibliography

1. Stock, J. B. et al., Microbiol. Reviews, 53, 450-490 (1989).

2. Stock, J. B., et al., Nature, 344, 395-400 (1990).

3. Comeau, D. E., et al., J. Bacteriol., 164, 578-584 (1985).

4. Makino, K., et al., J. Mol. Biol., 190, 37-44 (1986).

5. Makino, K., et al., J. Mol Biol., 192, 549-556 (1986).

6. Stock, A., et al., Pro. Natl. Acad. Sci. USA, 85, 1403-1407 (1988).

7. Jin, S., et al., J. Bacteriol., 172, 4945-4950 (1990).

8. Weiss, V. and Magasanik, B., Proc. Natl. Acad. Sci., USA, 85, 8919-8923 (1988).

9. Weber, R. F., and Silverman, P. M., J. Mol. Biol., 203, 467-478 (1988).

10. Leroux, B., et al., EMBO J., 6, 849-856 (1987).

11. Nixon, B. T., et al., Proc. Natl. Acad. Sci. USA, 83, 7850-7854 (1986).

12. Melchers, L. S., et al., Nucleic Acids Research, 14, 993-994 (1986).

13. Trach, K., et al., J. Bacteriol., 170, 4194-4208 (1988).

14. Stock A., et al., Proc. Natl. Acad. Sci. USA, 82, 7989-7993 (1985).

15. Rasmussen, B. A., Gluzman, Y. and Tally, F. P., Antimicrob. Agents. Chemother., 34, 1590-1592 (1990).

16. Rasmussen, B. A., Gluzman, Y. and Tally, F. P., Molecular Microbiol., 5, 1211-1219 (1991).

17. Miller, J. H., Experiments in molecular genetics, page 433, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1972).

18. Sambrook, J., et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989) .

19. Sanger, F., et al., Proc. Natl. Acad. Sci. USA, 74, 5463-5467 (1977).

20. Vieira, J., and Messing, J., Methods Enzymol., 153, 3-34 (1987).

21. Gold, L. , and Stormo, G., "Translation initiation," pp. 1302-1307 in Neidhardt, F., et al., eds., Escherichia coli and Salmonella typhimurium, American Society for Microbiology, Washington, D.C. (1987).

22. Weisberg, G. W., et al., J. Bacteriol., 164, 230-236 (1985).

23. Yager, T. D. and vonHippel, P. H., "Transcription and termination in Escherichia coli", pp. 1241-1275 in Neidhardt, F. C., et al., eds., Escherichia coli and Salmonella typhimurium, American Society for Microbiology, Washington, D.C. (1987).

24. Kyte, J., and Doolittle, R. F., J. Mol. Biol., 157, 105-132 (1982).

25. Bankaitis, V. A., et al., Cell, 37, 243-252 (1984).

26. Hall, M. N., and Silhavy, T. J., Mol. Biol., 146, 23-43 (1981).

27. Bachmann, B. J., "Derivations and genotypes of some mutant derivatives of Escherichia coli K-12", pp. 1190-1219 in Neidhardt, F. C., et al., eds., Escherichia coli and Salmonella typhimurium, American Society for Microbiology, Washington, D.C. (1987).

28. Salyers, A. A., et al., Molecular Microbiol., 4, 151-156 (1990).

29. Cohen, S. P., et al., Antimicrob. Agents Chemother., 33, 1318-1325 (1989).

30. Pugsley, A. P., and Schnaitman, C. A., J. Bacteriol., 133, 1181-1189 (1978).

31. Igo, M. M., et al., Genes Dev., 3, 1725-1734 (1989).

32. Ninfa, A. J., et al., Proc. Natl. Acad. Sci. USA, 85, 5492-5496 (1988).

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 39                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2651 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GGTTGTAGCTGCCGTTACATGCCATTGACAGTTCGTCGGTCGCCTCTTGAAACCTGCTTA60                 CTCATTAACAATGATTAAAGAAAGTAGATTTTCTGAAGAGAAATCTTTAATTTTTTATTA120                AATTTGCAAACCAAAGGCATATACGTTTTGTTATAGTGGTCAGAATACGACCTAAAAAAC180                GTCTTTCGGTTAATTATAGAGAACATCCTGTTAAAACAGGTTAAGCTGTTAGGAGTGTTA240                ATTAGGGAGTGTTAATTTTGTTGCTATGAAAAAGTCAACAATCTGGATATTAGGCATTAT300                TATGGGTCTTTCCTTTCTGAGTTGGCTCTATTTACAAGTGAGCTACATCGAAGAAATGGT360                GAGGATGCGTAAAGAACAATTTAATACATCCGTGCGAAATGCTTTGTTTCAGGTTTCAAA420                GGATGTGGAGTATGATGAAACGCAACGTTGGCTGTTAGAAGACATTACTGAAGCGGAACG480                TAGAGCACTGGCTCAGTCTTCTTCTACTACCGAACAGAAAAATGGTTTGATTCAGCAATC540                GGAGCGTTATAGGTTCAAGTCACCGGACGGAACCCTGTATTCGGAGTTTGAACTAAAGAT600                GATTACCACCGAGCCGTCGAAGGTGCCCAAAGCCATGATTTCGGAGAGACATGGCCGGAA660                TACCATTCCGCAGACATCGCGAAGCTTGACCGACGCTATTAAAAATAGGTATATGTATCA720                GCGTTTCCTGTCTGACGATGTAGCTTTGCGGATGATTTACAAAGCAAGCGATAAGTCGAT780                TGGCGAACGGGTGAACTTTAAGAAGCTGGATAATTATCTGAAGTCTAACTTTATTAATAA840                TGGTATAGAGCTGCTATATCATTTTTCGGTAATCGATAAAGATGGACGTGAGGTATATCG900                CTGTTCGGATTACGAAGAGGGAGGAAGTGAGGATTCTTATACCCAACCTCTGTTCCAAAA960                TGATCCGCCTGCGAAGATGAGTATTGTGAAGGTGCACTTTCCGGGAAACAAAGATTATAT1020               CTTCGACTCGGTTAGTTTTATGATCCCTTCGATGATATTCACTTTCGTACTGTTGATTAC1080               ATTCATCTTCACTATCTACATCGTCTTCCGCCAGAAGAAGCTGACAGAAATGAAGAATGA1140               CTTTATCAACAATATGACACACGAGTTCAAGACACCGATATCTACCATCTCGCTTGCCGC1200               GCAGATGCTGAAAGATCCCGCATTCGGGAAATCACCGCAGATGTTCCAGCATATATCGGG1260               AGTCATTAATGATGAAACGAAGCGGTTGAGATTCCAGGTGGAGAAAGTTCTTCAGATGTC1320               TATGTTCGACAGACAGAAAGCAACACTGAAGATGAAAGAACTCGATGCCAATGAGTTGAT1380               TTCCGGGGTTATCAATACGTTCGCTCTGAAGGTGGAACGCTATAATGGTAAGATTACATC1440               GAACCTTGAGGCTACCAATCCTGTTATATTTGCGGACGAAATGCATATGACCAATGTGAT1500               ATTCAACCTGATGGATAACGCGGTGAAATACAAGAAGCCCGAAGAAGACCTGGTGCTCGA1560               CGTGAGAACCTGGAACGAACCCGGTAAACTGATGATTTCGATACAGGACAACGGTATTGG1620               TATTAAAAAAGAAAACCTGAAGAAGGTGTTTGATAAGTTCTATCGCTGTCATACAGGTAA1680               TCTGCACGATGTAAAAGGTTTCGGTCTGGGACTGGCTTATGTGAAAAAGATTATTCAGGA1740               TCATAAGGGAACCATCCGGGCGGAGAGTGAACTGATTGTAGGAACTAAATTTATTATTGC1800               ATTACCTTTATTAAAAAATGATTGATATGGACGAGAAACTGCGTATTTTATTATGCGAGG1860               ATGATGAAAATCTTGGCATGCTTTTAAGAGAATATTTACAGGCGAAAGGTTACTCTGCTG1920               AGTTGTATCCTGATGGAGAAGCCGGATTTAAGGCTTTCCTGAAGAATAAATATGACTTGT1980               GCGTGTTCGACGTGATGATGCCTAAGAAAGATGGTTTCACGCTGGCACAGGAGGTTCGTG2040               CGGCCAACGCTGAAATTCCGATTATCTTCCTGACTGCAAAGACACTCAAGGAGGATATTC2100               TGGAAGGATTTAAGATTGGTGCGGATGATTACATCACCAAACCTTTCAGTATGGAAGAAC2160               TTACTTTCAGAATTGAAGCGATCCTGAGACGTGTTCGTGGAAAGAAGAACAAAGAAAGCA2220               ATATCTATAAGATCGGTAAGTTTACGTTTGATACACAAAAGCAGATTCTGGCTATCGGTG2280               ACAAACAAACTAAGCTGACTACCAAGGAATCGGAATTGCTGGGATTGCTGTGTGCACATG2340               CCAACGAGATTTTGCAGCGTGACTTTGCTTTGAAGACTATCTGGATTGATGATAACTATT2400               TCAATGCCCGTAGTATGGACGTATATATCACCAAACTGCGTAAGCACCTGAAGGATGATG2460               ATTCGATTGAGATTATCAACATCCACGGAAAAGGTTACAAGTTGATTACCCCCGAACCGG2520               AATCATAATGGAGAGGGGGATATACAGAAATAAAAAAGCCGGAAAACATTCGTTTTCCGG2580               CTTTTTTATTTCTGTTGAAAATATTAATCCGCAATTCTTTTATTGATCGCAATGTAAGAA2640               ATGAGTCCGAG2651                                                                (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 519 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetLysLysSerThrIleTrpIleLeuGlyIleIleMetGlyLeuSer                               151015                                                                         PheLeuSerLeuLeuTyrLeuGlnValSerTyrIleGluGluMetVal                               202530                                                                         LysMetArgLysGluGlnPheAsnThrSerValArgAsnAlaLeuPhe                               354045                                                                         GlnValSerLysAspValGluTyrAspGluThrGlnArgTrpLeuLeu                               505560                                                                         GluAspIleThrGluAlaGluArgArgAlaLeuAlaGlnSerSerSer                               65707580                                                                       ThrThrGluGlnLysAsnGlyLeuIleGlnGlnSerGluArgTyrArg                               859095                                                                         PheLysSerProAspGlyThrLeuTyrSerGluPheGluLeuLysMet                               100105110                                                                      IleThrThrGluProSerLysValProLysAlaMetIleSerGluArg                               115120125                                                                      HisGlyArgAsnThrIleProGlnThrSerArgSerLeuThrAspAla                               130135140                                                                      IleLysAsnArgTyrMetTyrGlnArgPheLeuLeuAspAspValAla                               145150155160                                                                   LeuArgMetIleTyrLysAlaSerAspLysSerIleGlyGluArgVal                               165170175                                                                      AsnPheLysLysLeuAspAsnTyrLeuLysSerAsnPheIleAsnAsn                               180185190                                                                      GlyIleGluLeuLeuTyrHisPheSerValIleAspLysAspGlyArg                               195200205                                                                      GluValTyrArgCysSerAspTyrGluGluGlyGlySerGluAspSer                               210215220                                                                      TyrThrGlnProLeuPheGlnAsnAspProProAlaLysMetSerIle                               225230235240                                                                   ValLysValHisPheProGlyLysLysAspTyrIlePheAspSerVal                               245250255                                                                      SerPheMetIleProSerMetIlePheThrPheValLeuLeuIleThr                               260265270                                                                      PheIlePheThrIleTyrIleValPheArgGlnLysLysLeuThrGlu                               275280285                                                                      MetLysAsnAspPheIleAsnAsnMetThrHisGluPheLysThrPro                               290295300                                                                      IleSerThrIleSerLeuAlaAlaGlnMetLeuLysAspProAlaVal                               305310315320                                                                   GlyLysSerProGlnMetPheGlnHisIleSerGlyValIleAsnAsp                               325330335                                                                      GluThrLysArgLeuArgPheGlnValGluLysValLeuGlnMetSer                               340345350                                                                      MetPheAspArgGlnLysAlaThrLeuLysMetLysGluLeuAspAla                               355360365                                                                      AsnGluLeuIleSerGlyValIleAsnThrPheAlaLeuLysValGlu                               370375380                                                                      ArgTyrAsnGlyLysIleThrSerAsnLeuGluAlaThrAsnProVal                               385390395400                                                                   IlePheAlaAspGluMetHisIleThrAsnValIlePheAsnLeuMet                               405410415                                                                      AspAsnAlaValLysTyrLysLysProGluGluAspLeuValLeuAsp                               420425430                                                                      ValTyrThrTrpAsnGluProGlyLysLeuMetIleSerIleGlnAsp                               435440445                                                                      AsnGlyIleGlyIleLysLysGluAsnLeuLysLysValPheAspLys                               450455460                                                                      PheTyrArgValHisThrGlyAsnLeuHisAspValLysGlyPheGly                               465470475480                                                                   LeuGlyLeuAlaTyrValLysLysIleIleGlnAspHisLysGlyThr                               485490495                                                                      IleArgAlaGluSerGluLeuAsnValGlyThrLysPheIleIleAla                               500505510                                                                      LeuProLeuLeuLysAsnAsp                                                          515                                                                            (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 236 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        MetIleAspMetAspGluLysLeuArgIleLeuLeuCysGluAspAsp                               151015                                                                         GluAsnLeuGlyMetLeuLeuArgGluTyrLeuGlnAlaLysGlyTyr                               202530                                                                         SerAlaGluLeuTyrProAspGlyGluAlaGlyPheLysAlaPheLeu                               354045                                                                         LysAsnLysTyrAspLeuCysValPheAspValMetMetProLysLys                               505560                                                                         AspGlyPheThrLeuAlaGlnGluValArgAlaAlaAsnAlaGluIle                               65707580                                                                       ProIleIlePheLeuThrAlaLysThrLeuLysGluAspIleLeuGlu                               859095                                                                         GlyPheLysIleGlyAlaAspAspTyrIleThrLysProPheSerMet                               100105110                                                                      GluGluLeuThrPheTyrIleGluAlaIleLeuArgArgValArgGly                               115120125                                                                      LysLysAsnLysGluSerAsnIleTyrLysIleGlyLysPheThrPhe                               130135140                                                                      AspThrGlnLysGlnIleLeuAlaIleGlyAspLysGlnThrLysLeu                               145150155160                                                                   ThrThrLysGluSerGluLeuLeuGlyLeuLeuCysAlaHisAlaAsn                               165170175                                                                      GluIleLeuGlnArgAspPheAlaLeuLysThrIleTrpIleAspAsp                               180185190                                                                      AsnTyrPheAsnAlaArgSerMetAspValTyrIleThrLysLeuArg                               195200205                                                                      LysHisLeuLysAspAspAspSerIleGluIleIleAsnIleHisGly                               210215220                                                                      LysGlyTyrLysLeuIleThrProGluProGluSer                                           225230235                                                                      (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        AspIleSerHisGluLeuArgThrProLeuThr                                              1510                                                                           (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        ProAsnAlaLeuGluSerAlaLeuGluAsnIleValArgAsnAla                                  151015                                                                         (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        ValAspAspAspGlyProGlyValSerPro                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        GlyGlyThrGlyProGlyProAlaIleVal                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        AsnValSerHisGluLeuArgThrProLeuThr                                              1510                                                                           (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        GluAspGlnLeuArgSerAlaIleSerAsnLeuValTyrAsnAla                                  151015                                                                         (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       ValGluAspAsnGlyProGlyIleAlaPro                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       GlyGlySerGlyLeuGlyLeuSerIleAla                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       ArgAlaAlaHisSerIleLysGlyGlyAlaGly                                              1510                                                                           (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       ProGluLysThrLeuGluAlaGlyLysAsnValValGlyAsnLeu                                  151015                                                                         (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       ValThrAspAspGlyAlaGlyLeuAsnThr                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       SerGlyArgGlyValGlyMetAspValVal                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       GlyIleAlaHisGluPheAsnAsnIleLeuGly                                              1510                                                                           (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       ProLeuGluLeuGlnGlnValLeuIleAsnIleCysLysAsnAla                                  151015                                                                         (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       IleSerAspAsnGlyGlyGlyIleProGlu                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       GlyGlyThrGlyLeuGlyLeuAlaSerVal                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:20:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                       MetLeuAlaHisGluIleLysAsnProLeuSer                                              1510                                                                           (2) INFORMATION FOR SEQ ID NO:21:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                       GlnAspGlnLeuIleGlnValPheLeuAsnLeuValLysAsnAla                                  151015                                                                         (2) INFORMATION FOR SEQ ID NO:22:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                       ValLysAspAsnGlySerGlyValProGlu                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:23:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                       ThrGlySerGlyLeuGlyLeuAlaLeuVal                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:24:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 125 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                       MetGlnGluAsnTyrLysAsnLeuValValAspAspAspMetArgLeu                               151015                                                                         ArgAlaLeuLeuGluArgTyrLeuThrGluGlnGlyPheGlnValArg                               202530                                                                         SerValAlaAsnAlaGluGlnMetAspArgLeuLeuThrArgGluSer                               354045                                                                         PheHisLeuMetValLeuAspLeuMetLeuProGlyGluAspGlyLeu                               505560                                                                         SerIleCysArgArgLeuArgSerGlnSerAsnProMetProIleIle                               65707580                                                                       MetValThrAlaLysGlyGluGluValAspArgIleValGlyLeuGlu                               859095                                                                         IleGlyAlaAspAspTyrIleProLysProPheAsnProArgGluLeu                               100105110                                                                      LeuAlaArgIleArgProValLeuArgArgGlnAlaAsn                                        115120125                                                                      (2) INFORMATION FOR SEQ ID NO:25:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 124 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                       GluProLeuLysHisValLeuLeuValAspAspAspValAlaMetArg                               151015                                                                         HisLeuIleIleGluTyrLeuThrIleHisAlaPheLysValThrAla                               202530                                                                         ValAlaAspSerThrGlnPheThrArgValLeuSerSerAlaThrVal                               354045                                                                         AspValValValValAspLeuAsnLeuGlyArgGluAspGlyLeuGlu                               505560                                                                         IleValArgAsnLeuAlaAlaLysSerAspIleProIleIleIleIle                               65707580                                                                       SerGlyAspArgLeuGluGluThrAspLysValValAlaLeuGluLeu                               859095                                                                         GlyAlaSerAspPheIleAlaLysProPheSerIleArgGluPheLeu                               100105110                                                                      AlaArgIleArgValAlaLeuArgValArgProAsn                                           115120                                                                         (2) INFORMATION FOR SEQ ID NO:26:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 124 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                       MetProAlaGlySerIleLeuValAlaAspAspAspThrAlaIleArg                               151015                                                                         ThrValLeuAsnGlnAlaLeuSerArgAlaGlyTyrGluValArgLeu                               202530                                                                         ThrGlyAsnAlaAlaThrLeuTrpArgTrpValSerGlnGlyGluGly                               354045                                                                         AspLeuValIleThrAspValValMetProAspGluAsnAlaPheAsp                               505560                                                                         LeuLeuProArgIleLysLysMetArgProAsnLeuProValIleVal                               65707580                                                                       MetSerAlaGlnAsnThrPheMetThrAlaIleArgProSerGluArg                               859095                                                                         GlyAlaTyrGluTyrLeuProLysProPheAspLeuLysGluLeuIle                               100105110                                                                      ThrIleValGlyArgAlaLeuAlaGluProLysGlu                                           115120                                                                         (2) INFORMATION FOR SEQ ID NO:27:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 125 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                       MetAlaArgArgIleLeuValValGluAspGluAlaProIleArgGlu                               151015                                                                         MetValCysPheValLeuGluGlnAsnGlyPheGlnProValGluAla                               202530                                                                         GluAspTyrAspSerAlaValAsnGlnLeuAsnGluProTrpProAsp                               354045                                                                         LeuIleLeuLeuAspTrpMetLeuProGlyGlySerGlyIleGlnPhe                               505560                                                                         IleLysHisLeuLysArgGluSerMetThrArgAspIleProValVal                               65707580                                                                       MetLeuThrAlaArgGlyGluGluGluAspArgValArgGlyLeuGlu                               859095                                                                         ThrGlyAlaAspAspTyrIleThrLysProPheSerProLysGluLeu                               100105110                                                                      ValAlaArgIleLysAlaValMetArgArgIleSerPro                                        115120125                                                                      (2) INFORMATION FOR SEQ ID NO:28:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 124 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                       MetMetAsnGluLysIleLeuIleValAspAspGlnTyrGlyIleArg                               151015                                                                         IleLeuLeuAsnGluValPheAsnLysGluGlyTyrGlnThrPheGln                               202530                                                                         AlaAlaAsnGlyLeuGlnAlaLeuAspIleValThrLysGluArgPro                               354045                                                                         AspLeuValLeuLeuAspMetLysIleProGlyMetAspGlyIleGlu                               505560                                                                         IleLeuLysArgMetLysValIleAspGluAsnIleArgValIleIle                               65707580                                                                       MetThrAlaTyrGlyGluLeuAspMetIleGlnGluSerLysGluLeu                               859095                                                                         GlyAlaLeuThrHisPheAlaLysProPheAspIleAspGluIleArg                               100105110                                                                      AspAlaValLysLysTyrLeuProLeuLysSerAsn                                           115120                                                                         (2) INFORMATION FOR SEQ ID NO:29:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 129 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                       MetAlaAspLysGluLeuLysPheLeuValValAspAspPheSerThr                               151015                                                                         MetArgArgIleValArgAsnLeuLeuLysGluLeuGlyPheAsnAsn                               202530                                                                         ValGluGluAlaGluAspGlyValAspAlaLeuAsnLysLeuGlnAla                               354045                                                                         GlyGlyPheGlyProIleIleSerAspTrpAsnMetProAsnMetAsp                               505560                                                                         GlyLeuGluLeuLeuLysThrIleArgAlaAspSerAlaMetSerAla                               65707580                                                                       LeuProValLeuMetValThrAlaGluAlaLysLysGluAsnIleIle                               859095                                                                         AlaAlaAlaGlnAlaGlyAlaSerGlyTyrValValLysProPheThr                               100105110                                                                      AlaAlaThrLeuGluGluLysLeuAsnLysIlePheGluLysLeuGly                               115120125                                                                      Met                                                                            (2) INFORMATION FOR SEQ ID NO:30:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                       AsnMetThrHisGluPheLysThrProIleSer                                              1510                                                                           (2) INFORMATION FOR SEQ ID NO:31:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                       GluMetHisIleThrAsnValIlePheAsnLeuMetAspAsnAla                                  151015                                                                         (2) INFORMATION FOR SEQ ID NO:32:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                       IleGlnAspAsnGlyIleGlyIleLysTyr                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:33:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                       LysGlyPheGlyLeuGlyLeuAlaTyrVal                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:34:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 32 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                                       MetIleAspMetAspGluLysLeuArgIleLeuLeuCysGluAspAsp                               151015                                                                         GluAsnLeuGlyMetLeuLeuArgGluTyrLeuGlnAlaLysGlyTyr                               202530                                                                         (2) INFORMATION FOR SEQ ID NO:35:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 34 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:                                       SerAlaGluLeuTyrProAspGlyGluAlaGlyPheLysAlaPheLeu                               151015                                                                         LysAsnLysTyrAspLeuCysValPheAspValMetMetProLysLys                               202530                                                                         AspGly                                                                         (2) INFORMATION FOR SEQ ID NO:36:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 12 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                                       PheThrLeuAlaGlnGluValArgAlaAlaAsnAla                                           1510                                                                           (2) INFORMATION FOR SEQ ID NO:37:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 9 amino acids                                                      (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:                                       GluIleProIleIlePheLeuThrAla                                                    15                                                                             (2) INFORMATION FOR SEQ ID NO:38:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                                       LysThrLeuLysGluAspIleLeuGluGlyPheLysIleGlyAlaAsp                               151015                                                                         AspTyrIleThrLysProPheSerMetGluGluLeuThrPheArgIle                               202530                                                                         GluAlaIleLeuArgArgValArgGly                                                    3540                                                                           (2) INFORMATION FOR SEQ ID NO:39:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 5 amino acids                                                      (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:                                       AspXaaGlyXaaGly                                                                15                                                                             __________________________________________________________________________ 

What is claimed is:
 1. A purified isolated Regulatory protein X (RprX) protein.
 2. The RprX protein of claim 1 whose amino acid sequence is depicted in SEQ ID NO.
 2. 3. A purified isolated Regulatory protein Y (RprY) protein.
 4. The RPrY protein of claim 3 whose amino acid sequence is depicted in residues numbered 1 to 236 of SEQ ID NO.
 3. 5. The RprY protein of claim 3 whose amino acid sequence is depicted in residues numbered 4 to 236 of SEQ ID NO.
 3. 